AITopics

Country:

North America > Canada (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Belgium > Flanders (0.04)

Industry:

Energy (1.00)
Banking & Finance > Real Estate (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Neural Information Processing SystemsDec-24-2025, 04:38:39 GMT

Introspective Learning : A Two-Stage approach for Inference in Neural Networks

introspective learning, name change, two-stage approach, (8 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.63)

Neural Information Processing SystemsOct-2-2025, 22:17:08 GMT

Interior Point Solving for LP-based prediction+optimisation

Solving optimization problems is the key to decision making in many real-life analytics applications. However, the coefficients of the optimization problems are often uncertain and dependent on external factors, such as future demand or energy or stock prices. Machine learning (ML) models, especially neural networks, are increasingly being used to estimate these coefficients in a data-driven way.

artificial intelligence, machine learning, optimization problem, (19 more...)

Industry:

Energy (1.00)
Banking & Finance > Real Estate (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

arXiv.org Artificial IntelligenceFeb-3-2025

Query Brand Entity Linking in E-Commerce Search

Liu, Dong, Nag, Sreyashi

Western brand name written in its original form versus its representation in Asian scripts), (ii) different surface forms for the same In this work, we address the brand entity linking problem for e-brand (e.g., abbreviations versus full names) and (iii) identifying commerce search queries. The entity linking task is done by either i) brand relationships between parent and sub-brands (e.g., a parent a two-stage process consisting of entity mention detection followed company and its product line brands). Therefore, in addition to by entity disambiguation or ii) an end-to-end linking approaches recognizing the brand names mentioned in the query, it is also that directly fetch the target entity given the input text. The task important to link them to the corresponding global brand entity. It presents unique challenges: queries are extremely short (averaging would be valuable to unify the concept of brand across different e-2.4 words), lack natural language structure, and must handle a commercial stores in a single namespace, i.e., brand entity (identity massive space of unique brands. We present a two-stage approach to each brand itself). Each brand entity is is unique across languages, combining named-entity recognition with matching, and a novel stores and surface forms. As part of this effort, we aim to recognize end-to-end solution using extreme multi-class classification.

brand entity, brand name, query, (14 more...)

2502.01555

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > District of Columbia > Washington (0.04)
(5 more...)

Genre: Research Report (0.40)

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.89)

arXiv.org Artificial IntelligenceOct-20-2024

Multimodal Learning for Embryo Viability Prediction in Clinical IVF

Kim, Junsik, Shi, Zhiyi, Jeong, Davin, Knittel, Johannes, Yang, Helen Y., Song, Yonghyun, Li, Wanhua, Li, Yicong, Ben-Yosef, Dalit, Needleman, Daniel, Pfister, Hanspeter

In clinical In-Vitro Fertilization (IVF), identifying the most viable embryo for transfer is important to increasing the likelihood of a successful pregnancy. Traditionally, this process involves embryologists manually assessing embryos' static morphological features at specific intervals using light microscopy. This manual evaluation is not only time-intensive and costly, due to the need for expert analysis, but also inherently subjective, leading to variability in the selection process. To address these challenges, we develop a multimodal model that leverages both time-lapse video data and Electronic Health Records (EHRs) to predict embryo viability. One of the primary challenges of our research is to effectively combine time-lapse video and EHR data, owing to their inherent differences in modality. We comprehensively analyze our multimodal model with various modality inputs and integration approaches. Our approach will enable fast and automated embryo viability predictions in scale for clinical IVF.

artificial intelligence, machine learning, natural language, (15 more...)

2410.15581

Country:

Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
(3 more...)

Genre: Research Report (0.50)

Industry:

Health & Medicine > Health Care Technology > Medical Record (0.55)
Health & Medicine > Therapeutic Area > Obstetrics/Gynecology (0.49)

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)
Information Technology > Sensing and Signal Processing (0.68)

Neural Information Processing SystemsOct-11-2024, 00:29:21 GMT

Introspective Learning : A Two-Stage approach for Inference in Neural Networks

In this paper, we advocate for two stages in a neural network's decision making process. The first is the existing feed-forward inference framework where patterns in given data are sensed and associated with previously learned patterns. The second stage is a slower reflection stage where we ask the network to reflect on its feed-forward decision by considering and evaluating all available choices. We use gradients of trained neural networks as a measurement of this reflection. A simple three-layered Multi Layer Perceptron is used as the second stage that predicts based on all extracted gradient features.

introspective learning, neural network, two-stage approach, (5 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (0.63)

arXiv.org Artificial IntelligenceAug-11-2024

Efficiently and Effectively: A Two-stage Approach to Balance Plaintext and Encrypted Text for Traffic Classification

Peng, Wei

Encrypted traffic classification is the task of identifying the application or service associated with encrypted network traffic. One effective approach for this task is to use deep learning methods to encode the raw traffic bytes directly and automatically extract features for classification (byte-based models). However, current byte-based models input raw traffic bytes, whether plaintext or encrypted text, for automated feature extraction, neglecting the distinct impacts of plaintext and encrypted text on downstream tasks. Additionally, these models primarily focus on improving classification accuracy, with little emphasis on the efficiency of models. In this paper, for the first time, we analyze the impact of plaintext and encrypted text on the model's effectiveness and efficiency. Based on our observations and findings, we propose a two-phase approach to balance the trade-off between plaintext and encrypted text in traffic classification. Specifically, Stage one is to Determine whether the Plain text is enough to be accurately Classified (DPC) using the proposed DPC Selector. This stage quickly identifies samples that can be classified using plaintext, leveraging explicit byte features in plaintext to enhance model's efficiency. Stage two aims to adaptively make a classification with the result from stage one. This stage incorporates encrypted text information for samples that cannot be classified using plaintext alone, ensuring the model's effectiveness on traffic classification tasks. Experiments on two datasets demonstrate that our proposed model achieves state-of-the-art results in both effectiveness and efficiency.

classification, encrypted text, traffic classification, (13 more...)

2407.19687

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > Washington > King County > Seattle (0.04)
North America > Canada > Ontario > Toronto (0.04)
(13 more...)

Genre: Research Report (0.50)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Dempsey, Ryan, Ethier, Jonathan

Clutter Classification Using Deep Learning in Multiple Stages

arXiv.org Artificial IntelligenceAug-8-2024

Path loss prediction for wireless communications is highly dependent on the local environment. Propagation models including clutter information have been shown to significantly increase model accuracy. This paper explores the application of deep learning to satellite imagery to identify environmental clutter types automatically. Recognizing these clutter types has numerous uses, but our main application is to use clutter information to enhance propagation prediction models. Knowing the type of obstruction (tree, building, and further classifications) can improve the prediction accuracy of key propagation metrics such as path loss.

classification, residential building, stage 1 0, (14 more...)

doi: 10.1109/SoutheastCon52093.2024.10500248

2408.04407

Country:

North America > Canada > Ontario > National Capital Region > Ottawa (0.04)
Oceania > Australia > Queensland > Brisbane (0.04)
North America > United States > Maryland > Montgomery County > Gaithersburg (0.04)
(3 more...)

Genre: Research Report (1.00)

Industry:

Health & Medicine (0.46)
Education (0.46)
Energy > Renewable > Geothermal > Geothermal Energy Exploration and Development > Geophysical Analysis & Survey (0.35)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.68)

arXiv.org Artificial IntelligenceJul-1-2024

The Solution for the ICCV 2023 Perception Test Challenge 2023 -- Task 6 -- Grounded videoQA

Zhang, Hailiang, Chao, Dian, Guan, Zhihao, Yang, Yang

In this paper, we introduce a grounded video question-answering solution. Our research reveals that the fixed official baseline method for video question answering involves two main steps: visual grounding and object tracking. However, a significant challenge emerges during the initial step, where selected frames may lack clearly identifiable target objects. Furthermore, single images cannot address questions like "Track the container from which the person pours the first time." To tackle this issue, we propose an alternative two-stage approach:(1) First, we leverage the VALOR model to answer questions based on video information.(2) concatenate the answered questions with their respective answers. Finally, we employ TubeDETR to generate bounding boxes for the targets.

perception test challenge 2023, video, yang yang, (12 more...)

2407.01907

Country: Asia > China > Jiangsu Province > Nanjing (0.05)

Genre: Research Report > New Finding (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Neural Information Processing SystemsMar-15-2024, 09:58:53 GMT

Simultaneous Sampling and Multi-Structure Fitting with Adaptive Reversible Jump MCMC

Multi-structure model fitting has traditionally taken a two-stage approach: First, sample a (large) number of model hypotheses, then select the subset of hypotheses that optimise a joint fitting and model selection criterion. This disjoint two-stage approach is arguably suboptimal and inefficient -- if the random sampling did not retrieve a good set of hypotheses, the optimised outcome will not represent a good fit. To overcome this weakness we propose a new multi-structure fitting approach based on Reversible Jump MCMC. Instrumental in raising the effectiveness of our method is an adaptive hypothesis generator, whose proposal distribution is learned incrementally and online. We prove that this adaptive proposal satisfies the diminishing adaptation property crucial for ensuring ergodicity in MCMC. Our method effectively conducts hypothesis sampling and optimisation simultaneously, and yields superior computational efficiency over previous two-stage methods.

hypothesis, inlier, minimal subset, (16 more...)

Country:

Oceania > Australia > South Australia > Adelaide (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)